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Abstract 

We introduce a series of generalized clustering coefficients based on String formalism given 
by Aoyama, using adjacent matrix in networks. We numerically evaluate Milgram condition 
proposed in order to explore q-th degrees of separation in scale free networks and small world 
networks. We find that scale free network with exponent 3 just shows 6-degrees of separa- 
tion. Moreover we find some relations between separation numbers and generalized clustering 
coefficient in both networks. 
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1 Introduction 

"Six degrees of separation" is an attractive subject that was triggered by Milgram [T) half a 
century ago. Six degrees of separation indicates that people have a narrow circle of acquaintances. 
A series of social experiments [2] made the suggestion, in which all people in USA are connected 
through about 6 intermediate acquaintances, more certain. 

We attack this subject based on the string formalism by fusing it into adjacent matrix de- 
scription, especially how cycle structures of networks affect the separation number q [3]. One 
characteristic measure for analyzing cycle structures is the clustering coefficient ]. But it only 
can reflect triangular structures in a network. There are many cycles with a lot of nodes. The 
usual clustering coefficient cannot analyses such structures. So we must introduce some indices in 
order to analyze more diverse cycle structures. 

We found that the formulation which we reformulated by fusing string formalism and adjacent 
matrix straightforwardly makes the generalization of usual (3-th) clustering coefficient that reflects 
only triangle structures in a network [5]. So we could analyze the effects of cycles with any nodes 
on the separation number by evaluating the generalized clustering coefficients. 

The crucial point of the reformulation is to count the number of non-degenerate strings [5], 
which are defined as subgraphs without any multiple edges and/or any cycles in its subgraphs. 
In this paper we first give a systematic way for it and give the expressions of a series of p-th 
generalized clustering coefficient based on the string formalism. Secondly we evaluate Milgram 
condition proposed by Aoyama [6], to analyze the separation number on networks by using the 
reformulation for scale free networks [8] and small- world networks [4]. Lastly we discuss the 
relations between the number of cycle and the separation number, and obtain some mathematical 
relations between them. 

2 STRING FORMALISM AND ADJACENT MATRIX 

We here review the formalism given in [5]. We consider a string-like part of a graph with 
connected j nodes and call it " j- string" . Let Sj be the number of non-degenerate j-string on 
graph. We also define strings homeomorphic to a circle as a non-degenerate string. It is generally 
so difficult to calculate Sj and would be impossible to calculate Sj with j > 7 at present [6]. 

We reformulate the string formalism by utilizing an adjacent matrix A — (a^ ). Generally the 
powers, A 2 ,A 3 ,A 4 --- , of A give information as to respecting that a node connects other nodes 



through 2, 3, 4, • ■ • intermediation edges, respectively. The information of the connectivity between 
two nodes in A n also contains multiplicity of edges, generally. For resolving the degeneracy, we 
introduce new series of matrices R n which give information as to the connectivity of two nodes, 
io and in, through n intermediation edges without multiplicity. We could find that R n is given by 
the following formula [5]; 
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Here the product of the Kronecker delta in the numerator plays role of protecting of non-degeneracies 
and in the denominator is needed to keep closed strings, respectively. (1) makes one count Sj in 
a unified way. We gave the expressions of R 2 ~ R 6 in [5] but the expressions are too complex, 
needed about 2 or 3 pages in A4 size, to describe them in this paper. 

By using its formulation, we can define the usual clustering coefficient which essentially counts 
the number of triangular in a network. Although there are some definitions of the clustering 
coefficient [3J, @], we adopt the usual global clustering coefficient C(3) [3J defined by 



6 x number of triangles 6A3 
^ number of connected triplets S3 ' 



where A q is generally the number of polygons with q edges in a network. But we need more 
indices in order to uncover properties of general polygon structures in a network. From (2), we 
can generalize it to p — th generalized clustering coefficient C (p) straightforwardly; 



2p x number of polygons 2pA p 
number of connected p-plets S p 



C(p) can be systematically evaluated by using Rn as shown the following relations; 

5 p =X)(^ P " 1 )«/2, (4) 
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C(p) = yj^ 1 ' (5) 



As an example, we obtain the following formula for the usual clustering coefficient. 



[R% = [A% - [A^Sif, |U4||=£^, (6) 
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3 APPLICATION TO SIX DEGREES OF SEPARATION 

We analyze general q-th dedrees of separation, according to its formalism. For q-th degrees of 
separation, Aoyama proposed a condition, so-called Milgram condition at network size TV [B]; 

M q = ?Z~0(N). (8) 

This condition means that the number of g-strings per node is nearly equal to the size of the 
considering network and gives a boundary whether q-th degrees of separation is satisfied or not. 
As M q grows larger, q-th degrees of separation is easier to be satisfied. We place the focus on scale 
free networks with the degree distribution of P(k) ~ k 1 and small-world networks with rewiring 
ratio a where we use Newman- Watts model [7] for the construction of small-world networks. 
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Figure 1: Separation number q v.s. log M q /N for scale free networks with several 7. 
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Figure 2: Separation number q v.s. log M q /N for small-world networks with several a 



Fig.l and Fig. 2 show the relation between log 10 M g /iVand q for scale free networks with diverse 
7's and for small-world networks with 0<a<latiV = 200. The interior of rectangles in both 
figures shows the region where (8) is satisfied. In Fig.l log 10 M q /N increases linearly with q. We 
see the four degrees of separation in networks with 7 < 2.5 but do not recognize that nodes are 
linked together for 7 > 3.5 up to six degrees of separation. 7 = 2.75 shows five degrees of separation 
and 7 = 3.0, which many real-world networks have about this value of exponent, just shows six 
degrees of separation. This result is also justified from analyses of A n [5], [5]. 

We observe 4 stripes in Fig. 2, which corresponds q = 2 (below curve)^ q — 4 (above one), 
respectively. Six degrees of separation is mostly observed even at small a > but nodes almost 
do not connect each other in 2 steps, even if a = 1 (random graph). This is consistent with the 
fact that the average path length rapidly decreases with a in small-world networks. 

4 MILGRAM CONDITIONONDITION AND GENER- 
ALIZED CLUSTERING COEFFICIENT 

We explore the relation between Milgram condition and the generalized clustering coefficients. 
By this, we analyze how cycle structure on a network is related with the separation number q. We 
first define the following two quantities; 

X = £c (p) , (9) 
Y EE log 10 M g . (10) 
Fig. 3 and Fig. 4 show the relation between X and Y in scale free networks with 1.8 < 7 < 4.0 and 
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small-world networks with 0<a<latiV = 200. In Fig. 3, we recognize that Y increases linearly 
with X; Y = AX + B. The two constants A and B are determined by numerical simulations and 
the values are described in Fig. 3. 

On the other hand, we observe 4 stripes structure in Fig. 4. The meaning of four stripes is as 
same as Fig. 2. In this cases we find that Y is a logarithmic function of X by least squares method 
with the square of multiple correlation coefficient R 2 > 0.995, so the relation Y = DlogX + E 
where D is a negative number holds for each q. The parameters D and E are also determined by 
numerical simulations but the details of the values are not described, since they are not crucial in 
this discussion. So it is noticed that there are the relations of an exponential function for scale 
free networks and an power low for small-world networks between log 10 M q /N and the sum of the 
generalized clustering coefficient, respectively; 

9 

M q ~ exp(c^C( p )) for Scale free networks, (11) 

p=3 

<7 

M q - C^2 C (p)r d for Small-world networks, (12) 

p=3 

where c and d are some constants determined by A and B and determined by D and F, respectively. 
This means that the effect of C(p), that is to say, cycle structures, has a greater influence on the 
scale free networks than small- world ones. In scale free networks, a crucial factor that decreases 
the separation number is hubs and the number of cycles is actually very few and cycles only 
appear incidentally. So it is considered that the effect of C(p) corresponding to short cuts on 
Milgram condition would be relatively serious. On the other hand, a crucial factor that decreases 
the separation number is short cuts, which do not drastically affect cycle structures consist of few 
nodes in Newman- Watts model, in small- world networks. Moreover an increase in C(p) with small 
p disturbs the transmission of a letter or information in small-world networks. This is in striking 
contrast to the cases of scale free networks where an increase in C{p) enhances the transmission. 
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This means that the effect of C(p) on transmission of information strongly depends on the basic 
topology of networks. 

Thus q greatly depends on the sum of C(p)(0 < p < q), which represents the state of the cycle 
structures up to q. This indicates that generalized clustering coefficients introduced in this article 
are effective indices to explore q-th degrees of separation. 



5 SUMMARY 

We attacked general q-th degrees of separation based on the string formalism that is reformu- 
lated by adjacent matrix description, especially how cycle structures on networks, whose informa- 
tion is charged with generalized clustering coefficients, affect the separation number q. 

We see that the scale free network with 7 = 3.0, which many real-world networks have about 
this value of exponent, just shows six degrees of separation and the four degrees of separation in 
networks with 7 < 2.5 but we do not recognize that nodes are linked together for 7 > 3.5 up to 
six degrees of separation. 

Six degrees of separation is mostly observed at small- world networks even with small a but nodes 
almost do not connect each other in 2 steps, even if a — 1 (random graph). This is consistent with 
the fact that the average path length rapidly decreases with a in small-world networks. 

There are a distinct difference between scale free networks and small-world networks in the 
relation between generalized clustering coefficients and q. We found that q at scale free networks 
depends on generalized clustering coefficients more strongly than small-world networks. 

It would become an important issue how the basic topology of networks depends on the relation 
between generalized clustering coefficients and q in more general cases. 
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